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Abstract 

Improving the structure and analysis in [1], we give a variation of the pairing 
heaps that has amortized zero cost per meld (compared to an O(loglogn) in [1]) 
and the same amortized bounds for all other operations. More precisely, the new 
pairing heap requires: no cost per meld, O(l) per find-min and insert, O(logn) per 
delete-min, and O (log log n) per decrease- key. These bounds are the best known for 
any self-adjusting heap, and match the lower bound proven by Fredman for a family 
of such heaps. Moreover, our structure is even simpler than that in [1]. 

1 Introduction 

The pairing heap [5] is a self-adjusting heap that is implemented as a single heap-ordered 
multi-way tree. The basic operation on a pairing heap is the linking operation in which 
two trees are combined by linking the root with the larger key value to the other as its 
leftmost child. The following operations are defined for the standard implementation of 
the pairing heaps: 

• find-min. Return the value at the root of the heap. 

• insert. Create a single-node tree and link it with the tree of the heap. 

• decrease-key. Decrease the value of the corresponding node. If this node is not the 
root, cut its subtree and link the two resulting trees. 

• meld. Link the two trees representing the two heaps. 

• delete-min. Remove the root of the heap and return its value. The resulting trees are 
then combined to form a single tree. For the standard two-pass variant, the linkings 
are performed in two passes. In the first pass, called the pairing pass, the trees are 
linked in pairs from left to right (pairing these trees from right to left achieves the 
same amortized bounds). In the second pass, called the right-to-left incremental- 
linking pass, the resulting trees are linked in order from right to left, where each 
tree is linked with the tree resulting from the linkings of the trees to its right. Other 
variants with different delete-min implementation were given in [2, 3, 5]. 



'Supported by an Alexander von Humboldt Fellowship. 



1 



Table 1: Previous results for upper bounds on pairing- heap's operations 





insert 


delete-min 


decrease-key 


meld 


Fredman et al. [5] 


O(logn) 


0(log n) 


O(logn) 


0(log n) 


Stasko and Vittcr [12] 


O(l) 


0(log n) 


O(logn) 


0(log n) 


Iacono [7] 


O(l) 


0(log n) 


O(logn) 


zero 


Pettie [10] 


O(2 2 v /loglogn ) 


0(log n) 


0(2 2 V' loslosn ) 


O(2 2 v /loglogn ) 


Elmasry [1] 


O(l) 


0(log n) 


0(log log n) 


0(loglog n) 


This paper 


O(l) 


0(log n) 


0(log logn) 


zero 



The original analysis of the pairing heaps [5] showed an O(logn) amortized cost for 
all operations. Another self-adjusting heap that requires O(logn) amortized cost per op- 
eration [11] is the skew heap. Theoretical results concerning the pairing heaps were later 
obtained through the years. Stasko and Vitter [12] suggested a variant that achieves 0(\) 
amortized cost per insert. The bounds for the standard implementation were later im- 
proved by Iacono [7] to: O(l) per inset, and zero cost per meld. Fredman [4] showed that 
fi(loglogn) amortized comparisons, in the decision-tree model, would be necessary per 
decrease-key operation for a family of heaps that generalizes the pairing heaps. Pettie [10] 

proved amortized costs of: O(logn) per delete-min, and O(2 2 v / ' og lo s n ) f or other opera- 
tions. Recently, Elmasry [1] introduced a variant that achieves the following amortized 
bounds: 0(\) per insert, O(logn) per delete-min, and O(loglogn) per decrease-key and 
meld. See Table 1. 

Several experiments were conducted on the pairing heaps, either comparing its per- 
formance with other priority queues [8, 9] or with some of its variants [2, 3, 12]. Such 
experiments illustrate that the pairing heaps are practically efficient and superior to other 
heaps, including the Fibonacci heaps [6]. 

In this paper, we give a variation of the pairing heaps that achieves the best known 
bounds for any self-adjusting heap for all operations. Namely, our amortized bounds are: 
zero cost per meld, 0(1) per find-min and insert, O(logn) per delete-min, and O(loglogn) 
per decrease-key. We describe the data structure in Section 2, prove the time bounds in 
Section 3, give possible variations in Section 4, and conclude the paper with some remarks. 

2 The data structure 

Similar to the standard implementation of the pairing heaps, we implement our variation 
as a single heap-ordered multi-way tree. Since we perform the decrease-key operations 
lazily, a pointer to the minimum element is maintained. 

The detailed implementations for various heap operations are as follows: 

• find-min. Return the value of the node pointed to by the minimum pointer. 

• insert. Create a single-node tree and link it with the main tree. Update the minimum 
pointer to point to this node if it is the new minimum. 

• decrease-key. Decrease the value of the corresponding node x. Update the minimum 
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pointer to point to x if it is the new minimum. Add x to the list of decreased nodes 
if it is not a root. 

We use the following procedure in implementing the upcoming operations: 
-clean-up: 

i. Perform the following for every node x in the list of decreased nodes: Cut 
x's subtree and the subtree of the leftmost child of x. Glue the subtree of 
the leftmost child of x in place of it's subtree, and add the rest of x's subtree 
(excluding the subtree of x's leftmost child that has just been cut) to the pool 
of trees to be combined. See Figure 1. 

ii. Arbitrary divide the trees of the pool into groups of O(logn) trees each (except 
possibly for one smaller group). For every group, sort the values of the roots of 
the trees and link the resulting trees in this order such that their roots form a 
path of nodes in the combined tree (make every root the leftmost child of the 
root with the next smaller value). Link the combined trees with the main tree 
in any order. 

• meld. Call clean-up for the smaller heap. Link the trees of the two heaps. Destroy 
the smaller heap. Update the minimum pointer to point to the root if it has the 
minimum of the melded heap. 

• delete-min. Call clean-up. Apply the standard two-pass implementation of the 
pairing heaps [5]. Make the minimum pointer point to the root of the resulting tree. 



3 Analysis 

We prove the following theorem that implies the claimed time bounds: 

Theorem 1 Starting with an empty heap, consider a sequence of operations S — (01,02,...) 
Let A = {i I Oi is a meld operation}, B = {i \ Oi is a find-min or an insert operation}, 
C = {i I Oj is a decrease-key operation} , and D = {i \ oi is a delete-min operation}. The 
sequence S can be executed on our pairing heaps in 0(\B\ +X)ieC l°gl°g n i + SieD ^°E n i)> 
where Hi is the number of elements that are in the heap at operation i and will leave the 
heap while performing S. 
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For the sake of the analysis, we categorize the nodes as follows. A node is black if it 
will remain in the heap after performing the sequence of operations under consideration, 
otherwise it is white. A black node whose descendants are all black is called an inactive 
node. Let w(x) be the number of white descendants of a node x, including x if it is white. 

1. Inactive nodes: Every node x with w(x) = 0. 

2. Active nodes: Other nodes. 

To bound the cost of the heap operations, we use a combination of the potential function 
and the accounting methods [13]. 

3.1 The potential function 

Consider the link between a node x and its parent p(x) . Let w' (x) be the number of white 
descendants of p(x) restricted to the subtrees of the right siblings of x, including p(x) if it 
is white. We use the potential function 



Despite the fact that the potential on a link may reach logn, the sum of potentials on 
a path from a node z to any of its descendants telescopes to at most log w(z). If the path 
is the left spine of the subtree of z, the sum of potentials telescopes to exactly log w(z). 



Consider the following two cases: 

• a white node is inserted in a heap with an active root. 

• two heaps with active roots are melded. 

To fulfill the potential requirements, 0(^ ieD lognj) units are borrowed from the al- 
lowable cost for the delete-min operations that will be performed on the white nodes. The 
following lemma illustrates that these debits are enough to cover the above two cases. 

Lemma 1 Consider the heap at any time during the execution of the sequence of oper- 
ations S. Let D' = {i | Oj is a delete-min operation that will be performed on a node 
currently in the heap}. The sum of the potentials on the links formed by insert or meld 
operations is at most J2ieD< l°§ n i; where in is the number of elements that are in the heap 
at operation i and will leave the heap while performing S. 

Proof. Let r be a tree representing a heap that has k > white nodes at this point 
of time. Let D' T be the set D' restricted to the operations performed on the nodes of r, 
and P T be the sum of the potentials on the links of r formed by insert or meld operations. 
We prove by induction the stronger fact that P T < X^=i logi. Since all the white nodes 
will eventually be deleted, then J^ i=1 logi < J2ieD< \°E n i- Consider an insert operation, 
where a white node is linked to r resulting in the tree r'. The required potential on this 



link is log (k + 1). By induction, P T > < log (k + 1) + J2i=i 1°S* = J2i=i logi- Consider a 




w{x)>0 



log 




3.2 Debits 
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meld operation, where two trees t\ and ti with active roots are linked resulting in tree r'. 
Assume that n and T2 have fci , ki > white nodes, respectively. The required potential 
on this link is at most log (fci + k-i). By induction, P T > < log (fci + k 2 ) + Ejii 1°S* + 
E*=i l°gi < E 4 =t fc2 lo g«- This follows from the fact that fci! + fc 2 ! < (fci + fc 2 - 1)!, for 
any integers fci, fc 2 > 1. □ 

3.3 Credits 

We maintain the following credits in addition to the potential function: 

- Decrease credits: O(loglogn) credits for every decreased node since the previous 
clean-up is performed. 

- Heap credits: 0(log n') credits per heap, where n' is the size of this heap. 

- Active-parent credits: 0(1) credits for every child of an active node. 

- Active-run credits: 0(1) credits for every active node with an inactive right sibling. 

3.4 The time bounds 

Next, we analyze the time bounds for our operations. Each operation must maintain the 
potential function, the credits, and pay for the work it performs. 

3.4.1 find-min 

No potential or credit changes are required. The actual work of find-min is O(l). It follows 
that the worst-case cost of find-min is 0(1). 



3.4.2 insert 

If the inserted node is white, extra potential units may be needed. But, as Lemma 1 
illustrates, these units are borrowed from the logarithmic cost per delete-min, and the 
insert operation need not pay for that. 

Assume that as a result of the insert operation node x is linked to node y. If y is active, 
the active-parent credits need to be increased by 0(1). If x is active, and the previous 
leftmost child of y was inactive, the active-run credits need to be increased by 0(1). Since 
the size of the heap increased by one, the heap credits need to be increased by O(l). The 
decrease credits need to be increased by 0(loglog(n + 1) — log log n) per decreased node, 
which still sums up to 0(1) as indicated by the following proposition. 

Proposition 1 n ■ (log log (n + 1) — log log n) = O(l). 

Proof. For n > 2, 

n + 1 

n ■ (log log (n + 1) — log log n) < n ■ log 

n 

= log(l + -)" 
n 

But (1 + i) n < e, where e is the base of the natural logarithm. □ 

The actual work to link an inserted node with the main tree is O(l). It follows that 
the amortized cost of insert is 0(1). 
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3.4.3 decrease-key 

No potential changes are required. The decrease-key pays O(loglogn) credits for the 
decreased node. The actual work it performs is 0(1). It follows that the amortized cost 
of the decrease-key operation is O (log log n). 

3.4.4 clean-up 

First, consider the effect of a cut performed on a decreased node x: 

Consider the path of nodes from the root including all the ancestors of x followed by 
the nodes on the left spine of x's subtree. Since we cut the subtree of x and replace it 
with the subtree of its leftmost child, the nodes of the above path remain the same except 
for x. If all the descendants of x are black, possibly excluding the subtree of its leftmost 
child, then the potentials on all the links do not change as a result of the cut. Otherwise, 
all the ancestors of x before the cut are active. In such case, the proof given in [l]can be 
applied, indicating that the sum of the potential on all the links does not increase. 

If x and both its left and right siblings are active while its leftmost child is inactive, 
then the number of active-runs increases by one, and 0(1) credits would be needed and 
paid for from the released decrease credits. 

Second, consider the effect of combining the trees and linking them with the main tree: 
The trees of a group are combined by sorting the values in their roots and linking them 
accordingly in order. Since the size of a group is O(logn), the actual work done in sorting 
is paid for from the released decrease credits (O(loglogn) credits per node). This will 
result in a new path of links. Since the sum of the potential values on a path telescopes, 
the increase in potential as a result of combining the trees of a group and then linking this 
group to the main tree is O(logn). This O(logn) potential increase is also paid for from 
the decrease credits, except for possibly the last group. (The last group may be a smaller 
group, and its decrease credits may not be enough to pay for the increase in potential.) 

As a result of a link the number of active-runs and active- parents may increase by one, 
and O(l) credits would be needed and again paid for from the decrease credits. 
It follows that the overall amortized cost of the clean-up procedure is O(logn). 

3.4.5 meld 

As for insert, extra potential units may be needed. But, as Lemma 1 illustrates, these 
units are borrowed from the logarithmic cost per delete-min. 

The cost of the clean-up performed on the smaller heap is O(logn'), where n' is its 
size. Since the size of the combined heap is at most twice the size of the larger heap, the 
heap credits for the combined heap need to be incremented by O(l). Similar to insert, the 
active-parent credits and the active-run credits may need to be increased by 0(1). The 
actual work for meld, other than the clean-up of the smaller heap, is 0(1). All these costs 
are paid for from the heap credits of the smaller heap, before it is destroyed. 

It follows that the meld operation pays nothing; everything is taken care of by others. 

3.4.6 delete-min 

We think about the two-pass pairing as being performed in steps. At the i-th step, the 
pair of trees that is the i-th pair from the right among the subtrees of the deleted root 
are linked, then the resulting tree is linked with the combined tree from the linkings of all 
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Figure 2: A delete-min ((cj / bi) / a{) step. 



the previous steps. Each step will then involve three trees and two links. Let cii be the 
tree resulting from the linkings of the previous steps, and let Ai be the number of white 
nodes in a,. Let bi and be the i-th pair from the right among the subtrees of the deleted 
root to be linked at the i-th step, and let B t and Cj respectively be the number of white 
nodes in their subtrees. It follows that A i+i = Ai + Bi + d. Let (n / Ti) denote the 
tree resulting from the linking of tree t\ to tree t-i as its leftmost subtree. See Figure 2. 

We distinguish between four cases, according to the types of the roots of bi and a and 
who wins the comparison. 

1. Both roots are inactive: 

There was no potential on the two links that were cut, and no potential is either 
required on the new links. The actual cost of this step is paid for from the released 
active-parent credits, as these two roots were children of an active parent and at least 
one of them is not any more. 

2. An active root is linked to an inactive root, and ((• • • /■■■)/ ai): 

The potential that was related to the active root before the operation is enough to 
cover the potential of the new link with ai. If the leftmost child of the inactive root 
was inactive before the link, the active-run credits need to be increased by 0(1). As 
for the previous case, these possibly-needed extra credits and the actual cost of the 
step are paid for from the released active-parent credits. 

3. (a) Both roots are active: 

The active-run credits may need to be increased by 0(1). 

The potential on the two links that are cut at the i-th step was 



log 



Aj + Bj 
Bi 



+ log 



Aj + Bj + a 

Ci 



We consider the four possibilities: 



i. {(a / bi) / ai ): 

The potential on the new links is 



log 



Bi + Ci 



+ log 



A t + Bj + d 
Bi + Ci 



= log 



A t + Bj + d 
Ci 
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The difference in potential is 



, Bi Bi + d 

l0g I-T^ <l0g -4- 



ii. {{bi / a) / at): 

The potential on the new links is 



B t + a A l + B l + C t Ai+Bi + d 

l°g 5 \- log 5 = log ■ 



Bi Bi + Ci B. 

The difference in potential is 

, Ci Bi + Ci 

l0g I-T^ <l0g -4-- 

hi. (ai /" (c< /" k)): 

The potential on the new links is 

Bi + Ci Ai + Bi + Ci 
log — ^ h log ■ 



Ci Ai 

The difference in potential is 

. {Bi + Ci)-Bi B t + C t 

l0g A-(A-T^) <2l0g -4- 

iv. {a t / {bi / a)): 

The potential on the new links is 

. Bi + d Ai + Bi + d 
l0g -B— +1 ° g Ai • 

The difference in potential is 

{Bi + Ci) ■ Ci Bi + d 

lQ g A 1 A T~> \ < 2 lOg ■ 



A t ■ {Ai + Bi) ° Ai 

(b) One root is active and the other is inactive, and {ai /{■■■/■■ ■)): 
The active-run credits may need to be increased by 0(1). 
Since either Bi or d equals zero, we use = max {Bi, d} for the other value. 
The potential on the cut links is 

. Ai + Mi 
fog . 

8 Mi 

The potential on the new links is 

A t + M, 



log- 



A l 

The difference in potential is 



, Mi Bi + d 

log — = log ■ 
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- If Bi + d < Ai/2, then log ( *X ) < -1- Then, for all the above sub-cases, 

the change in potential is less than —1. This released potential is used to pay 
for the possibly-required increase in the active-run credits, in addition to the 
actual work done at this step. 

- If Bi + Ci > Ai/2, we call this step a bad step. For all the above sub-cases, the 

change in potential resulting from all bad steps is at most 2 log Bi \ Ci (taking 
the summation for positive terms only, i.e. Bi + d > Ai). Since Ay > Bi + d 
when i' > i, the sum of the changes in potential for all steps telescopes to 
O(logn). It remains to account for the actual work done at the bad steps. 
Since A i+ \ = Ai + Bi + C», a bad step results in A i+ i > \A t . Then, the num- 
ber of bad steps is 0(log n). It follows that the increase in the active-run credits 
and the actual work done at bad steps is 0(log n) for each delete-min operation. 



4. An inactive root is linked to an active root, and ((• • • /"•••) / ai): 

The potential that was related to the active root before the operation is enough to 
cover the potential of the new link with a%. To cover the actual work done in such 
step, consider the two steps that follow it. If those two steps are of the same type 
as this step, the number of active-runs decreases (at least one inactive node is taken 
out of the way of two active-runs) and such released credits are used to pay for all 
three steps (this is similar to Iacono's triple-white notion in his potential function 
[7]). Otherwise, one of those two steps will pay for the current step as well. 

From the above case analysis, it follows that the amortized cost of the delete-min 
operation is O(logn). 

4 Variations 

The main difference between our implementation and the standard implementation of the 
pairing heaps is the clean-up procedure. We chose to perform the clean-up before the 
delete-min operation, and to apply it to the smaller heap before the meld operation. The 
following variations are as well possible: 

• It is possible to periodically perform the clean-up, once the number of decreased 
nodes reaches 0(logn) following a decrease-key operation. This assures that when 
the clean-up is performed prior to delete-min operations, there will be only O(logn) 
decreased nodes (one group). 

• It is possible not to call clean-up prior to meld operations, and to do all the work 
prior to delete-min operations instead. 

• In [4], Fredman stated that the cost of m pairing-heap operations, including n 
delete-min operations, is 0(m\og 2m / n n). This bound implies a constant cost for 
the decrease-key operation when m = fl(n 1+e ), for any constant e > 0. This suggests 
that, when the number of the decreased nodes is large enough, we perform the clean- 
up by cutting each of the affected subtrees and directly linking it with the main tree 
(similar to the standard pairing- heaps implementation). 
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5 Conclusion 

We have given a variation of the pairing heaps that achieves the same amortized bounds 
as Fibonacci heaps, except for decrease-key (which still matches Fredman's lower bound 
for, what he calls [4], a generalized pairing heap). Three important open questions are: 

• Is there a self-adjusting heap that achieves amortized o(loglogn) decrease-key cost? 

• Is it possible that the original implementation of the pairing heaps has the same 
bounds as those we achieve in this paper? 

• Which heap performs better in practice? 
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